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Abstract 

Suppose that Alice and Bob are given each an infinite string, and they 
want to decide whether their two strings are in a given relation. How 
much communication do they need? How can communication be even 
defined and measured for infinite strings? In this article, we propose a 
formalism for a notion of infinite communication complexity, prove that it 
satisfies some natural properties and coincides, for relevant applications, 
with the classical notion of amortized communication complexity. More¬ 
over, an application is given for tackling some conjecture about tilings and 
multidimensional sofic shifts. 
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1 Introduction 

In this article, we are interested in introducing a generalization of Communica¬ 
tion Complexity [mil] to infinite inputs. The motivation comes from the theory 
of tilings and regular languages of infinite pictures, specifically to express that 
two neighbour cells can exchange only a finite information. 

In this setting, Alice and Bob have a binfinite word x and y. We think of this 
input as an infinite array of cells, each containing a symbol, the cell Xi having 
a channel of communication with the cell yi. We are looking at decentralized 
protocols, that is each decision must be made locally. These precautions are 
mandatory to avoid unrealistic protocols. As an example, a protocol that sends 

2 bits in each channel can be simulated by a protocol that sends only one bit 
in each channel: instead of sending, in each channel i, two bits ui and Vi, just 
send the bit Ui in channel 2i and the bit Vi in channel 2 j -|- 1. Our definition will 
implicitely forbid such a protocol, and provide a meaningful definition of ICC. 
This situation is similar to what happens for an other model of communication 
complexity with infinite inputs called algebraic communication complexity [TJ 
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IIS], where coding two real numbers into a single real number should also be 
forbidden. 

We will focus in this article only on nondeterministic communication com¬ 
plexity. It is the most natural from the point of view of applications, and is also 
the easiest to define. We postpone definitions of deterministic (and probabilis¬ 
tic) communication complexity to further articles. 

Once the definition is given, we will see that many propositions from finite 
communication complexity can be translated, with different proofs, to the in¬ 
finite setting. We will prove in particular that this new notion of complexity 
coincides for relations with what is known as amortized complexity [5]. Amor¬ 
tized complexity asks for the best protocol to decide whether n pairs {xi^yi) 
belong to some relation i?, for large n, while infinite complexity asks for the 
best protocol when the number of pairs is infinite. It is natural that these 
quantities should be equal. 

While the definition is interesting in its own right, the main motivation comes 
from the theory of tilings, where the concept is quite natural when dealing with 
regular languages of infinite pictures. In the last section we will see how this 
new tool gives us new insights into this theory. 

2 Preliminaries 

2.1 Communication Complexity 

We introduce here the formalism of (nondeterministic) Communication Com¬ 
plexity, with a few innocuous adjustments that will make the transition to infi¬ 
nite communication complexity easier. The first chapters of [5] are recommended 
readings. 

Let S C A X y be a binary relation (in communication complexity, we 
usually think of S' as a boolean function). Alice is given x € X and Bob is 
given y G Y and they want to know whether {x, y) G S. The communication 
complexity of S is the number of bits that Alice and Bob need to exchange to 
decide whether (a;, y) € S. 

In nondeterministic communication complexity, Alice and Bob are allowed 
nondeterministic choices, and must succeed only if (x, y) G S. As in the defini¬ 
tion of NP, an alternative definition can be given in terms of proofs: Alice and 
Bob are both given nondeterministically a “proof” z that {x, y) G S and each of 
them uses z to verify that indeed {x, y) G S. This is defined formally as follows: 

Definition 2.1. A nondeterministic protocol for a relation S Q X x Y is a 
tuple {Z, Sx, Sy) where Sx Q X x Z, Sy CYxZ and 

(x, y) G S <;=> 3z G Z, {x, z) G Sx, {y, z) G Sy . 

The size of the protocol is log|y|. The nondeterministic communication com¬ 
plexity of S, denoted Af{S) is the minimal size of a protocol for S. 

We now give a few examples that will be useful later on. 

• Let X = Y = {0,1}” and EQ = {{x,y)\x = y}. Then Af{EQ) = 0{n). 
Intuitively, Alice sends all of her bits to Bob. (Formally, take Z = {0,1}" 
and Sx = Sy = EQ). It can be proven that this bound is tight: J\f{EQ) = 
n. 
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• Let X = Y = {0, !}"■ and NEQ = {{x,y)\x ^ y}. Then we have 
J\f{NEQ) = O(logn). Intuitively, Alice chooses i and sends both i and Xi 
to Bob, who verifies that yi Xi. This takes logn + 1 bits. (Formally, take 
Z = |l,n]x{0,1}, Sx = {(a;, (*,a))|a;i = a} and Sy = {{y, {i,a))\yi ^ a}). 
It can be proven that this bound is almost tight: N{NEQ) > logu. 

2.2 Symbolic Dynamics 

We will now generalize this definition to take into account infinite inputs, that 
is inputs in A* for some finite set A. Choosing biinfinite words rather than 
infinite words is of no consequence but simplifies the exposition. 

The idea is that Alice and Bob both have an infinite word as input, which 
can be thought of as an infinite collection of cells. There is a channel of com¬ 
munication between the cell i of Alice and the cell i of Bob, and Alice and Bob 
should use these channels to communicate. 

Now we have to be careful with the exact definition of a protocol. Consider 
the case of the problem EQ where X = Y = A* for some finite alphabet A. 
The “optimal” protocol for EQ should be for Alice something like sending Xi 
through the f-th channel f, and for Bob to compare the output of its i-th channel 
with yi. This would use log |A| bits per channel. 

However, other protocols are possible: A* is in bijection with {0,1}* so 
another protocol would be for Alice to transform its word x G A* into a word 
f[x) G {0,1}*, sending f{x)i through the i-th channel, then for Bob to apply 
on the whole word it receives through all channels, then compare the output 
with y. This would use only 1 bit per channel. 

This protocol is of course not what a good protocol should be and it will 
forbidden by the definitions. We will ask for all cells of Alice to act in the exact 
same way, and for the communication on a given cell to depend only on finitely 
many cells of the input. 

The best way to formalize all this convincingly is with the vocabulary and 
the formalism of symbolic dynamics. Indeed, the first property corresponds to 
an invariance by translation, and the second property to a continuity argument, 
both being central in the study of symbolic dynamics. We refer the reader to 
[3] for a good introduction to this domain. 

Using the definitions below, we will be able to answer the three following 
questions: 

• What relations S should be considered ? 

• What should be Z, Sx, Sy ? (What is a protocol ?) 

• How do you measure the size of a infinite set ? (What is the complexity 

?) 

The basic infinite sets we will be considering are called subshifts. Formally 
speaking, a subshift is a topologically closed subset which is invariant by the 
shift map a, defined by a-{x)i = Xi+i for x G A* and i € 1^. This encom¬ 
passes both desired properties: Every cell behaves the same (shift-invariance), 
and operations depend on finitely many cells (continuity, here in the form of 
closedness/compactness). We will use here the following, equivalent, definition: 
A set S' C A^ is a subshift (or simply shift) if there exists a set of words E over 


3 


A so that S is exactly the set of infinite words that do not contain any pattern 
in as a factor. 

For example, the set of biinfinite words over A = {a, b} that contains at 
most one symbol 6 is a subshift, corresponding to = {ba^b,n S N}. 

Intuitively, Alice can semi-decide if an biinfinite word w is in a subshift S: 
on each cell i, the same program is executed that reads continuously the letters 
around i, and fails if it sees a forbidden pattern. If there is one, some cell will 
fail at some time t and every cell will fail at some time. 

This might seem to be too powerful (Alice doesn’t even need to be com¬ 
putable in this definition). There are various classes of subshifts that might be 
expressed in terms of restriction of Alice’s power. We will see the class of sofic 
shifts later on, but we will focus here on shifts of finite type. A subshift S is 
of finite type if it can be given by a finite set of words of some size n. This 
corresponds to the case where in each cell i the same program is run for a given 
time t (reading the content of cell i and adjacents cells) and then the word is 
accepted if none of the programs has failed by time t. 

With this formalism, we can now describe what a relation and a protocol 
are. It remains to dehne how to measure the size of the sets. The good notion 
for this is entropy [S]. Informally, a subshift S has entropy logc if it has Vl{c^) 
differents factors of length n. 

Formally, if we denote by c„ the number of different words of size n of S, 
then the entropy H (S) can be defined by: 

H(S) = . 

n n 

As an example {0,1,2... c}^ has entropy log c. The entropy is a good notion 
of complexity, as is made clear by the following remarks. First, if S' C S" then 
H{S) < H{S'). Second, if S maps onto S', then H{S) > H{S'). More precisely: 

Definition 2.2. A block code f is a continuous, shift-commuting map f : S ^ 
S'. For such a map, H{f{S)) < F[{S). If f is one-to-one, then H{f{S)) = 
H{S). 

We say that S factors onto S' (and that S' is a factor of S) if there is an 
onto block code f : S ^ S' (also called a factor map) If f is also one-to-one, f 
is called a conjugacy, and we say that S and S' are conjugated. 

In particular, if S' is a factor of S then II{S') < II{S). If S' is conjugated 
to S then H{S') = H{S). 

These few properties mean that reasoning on the size of finite sets may be 
translated easily into statements on entropy of shifts. A notable difference is 
that i/(S) < II{S') does not imply that there is a one-to-one map from S to 
S', or an onto map from S' to S. 

2.3 Definition 

We are now ready to define nondeterministic communication complexity: 

Definition 2.3. A nondeterministic protocol for a subshift S C X x Y is a 
tuple {Z, Sx, Sy) where Sx C X x Z, Sy QY x Z are subshifts and for all 
{x,y) eX xY, 

{x, y) € S <;=> 3z G Z, {x, z) e Sx, (y, z) G Sy ■ 
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The size of the protocol is H{Z). The nondeterministic communication com¬ 
plexity of S, denoted Af{S) is the infimum of the size of a protocol for S. 

As explained above, the definition mirrors the one in the finite case, replacing 
“finite set” by “subshift” and “size” by “entropy”. 

We will see in the next section that this is the good definition to adopt, as 
natural and obvious statements will indeed be true. 

In the remaining of the paper, we will assume that if A C A1 x y is a 
subshift for which we want to compute the communication complexity, then 
X = {x|3i/ € Y, {x,y) G S} (that is the map A —> X is onto), and similarly 
for Y. This is an innocuous hypothesis but necessary for theorems below not 
to fail for stupid reasons. Many statements also assume implicitely that S is 
nonempty. 

With these hypotheses, a protocol {Z,Sx,Sy) entails a few maps. Denote 
by L the set of triples (cc, y, z) such that z is a protocol for (x, y), that is (z, x, y) 
satisfies simultaneously z G Z, (x, z) G Sx, (y, z) G Sy- 

Many properties below may be deduced from the following diagram: 



^ ^ ( 1 ) 



By definition of a protocol, TIxxy is always onto, and we may suppose 
without loss of generality that the three other maps involved in this diagram 
are also onto (hence factor maps). This diagram may be completed by maps 
from/to Sx and Sy but they will not be needed explicitely in the following 
sections. 

3 Properties 

In this section, we will give a few properties of the infinite communication 
complexity. 

First a few obvious properties: 

Proposition 3.1. Let S C X x Y. 

• A/’(S') > 0 (unless S is empty); 

• M{S) < miTi{H{X), H{Y)); 

• If X' and Y' are subshifts, then J\f{S fl {X' x Y')) < J\f{S). 

Proof. • Let Z he a. protocol for S. If Z is empty (and then H{Z) = —oo), S 
will be empty. Otherwise, Z is nonempty and then H{Z) > 0. Therefore 
MiS) > 0 . 

• A/’(S') < H{X) is clear: Take the protocol where Alice sends her input to 
Bob. Formally take Z = X, Sx = {(x,x)|x G X} and Sy = S. 
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• For the last item, it is clear from the definition that a protocol for S may 
be transformed into a protocol for S PI {X' x Y') by changing only Sx and 
Sy- 

□ 


The first obvious example requires no communication: 

Proposition 3.2. (If X and Y are nonempty,) Af{X x F) = 0. 

Proof. Alice sends something to Bob, independently of her input. Formally, 
take Z consisting only of the periodic point ...000... {Z is of entropy 0), 
Sx = X X Z and Sy = YxZ. This proves Af{X x F) < 0. Entropy is negative 
only when Z is empty, hence ffiS) > 0. 

□ 

The first interesting example is equality: Give Alice and Bob each a word x 
and y, and decide ii x = y. 

Definition 3.3. IfT is a subshift, then EQt = {{t,t)\t € T}. 

Proposition 3.4. N{EQt) = H{EQt) = H{T). 

Proof. N{EQt) < H{T) is clear: Just take the protocol where Alice sends its 
input to Bob. 

Conversely, let {Z,Sx,Sy) be a protocol for EQy. It is clear that in Dia¬ 
gram [U the map II^ should be one-to-one: A word 0 cannot be a protocol for 
two different pairs {x,x) and {y,y) as this would imply {x,y) € EQy. As Hz is 
one-to-one, this implies H{Z) = H{L) > EQy. □ 

We will now give three other proofs of the previous proposition, introducing 
other methods to give lower bounds for Communication Complexity. 

First is the well-known method of fooling sets. 

Definition 3.5. Let S Q X x Y be a subshift. 

A fooling set is a subshift ECS sueh that: For each (x, y) G E, there exist 
at most countably many pairs {x',y') € E so that (x,y') G S and {x',y) G S. 

The usual dehnition in the finite case replaces “at most countably many 
pairs” by “no other pair”. In the infinite case, we can obtain a stronger state¬ 
ment. 

Theorem 1. Let E be a fooling set for S. Then Af{S) > H(F). 

As EQt is a fooling set for EQt, this gives a proof of the previous propo¬ 
sition. 

Proof. Let {Z,Sx,Sy) be a protocol for S. Let Lp he the restriction of L to 
tuples (z, X, y) where {x, y) G E. We now look at the diagram: 


IIxxF 

Hz 

t- 

Lp —^ 

GH 

{z,x,y) 


We suppose wlog that the map II^ in the preceding diagram is onto (replace 
Z by f{Z)). Now the hypothesis implies that each z & Z has only countably 
many preimages. This means that the map IIz is a countable-to-one factor map, 
which implies that H{L) = H{Z) [TU]. As II^ is onto, we have H{Z) > H{F). 
The result follows. □ 
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We now relate the communication complexity with the largest set that can 
be extracted simultaneously from X and Y. 

A common factor of X and y is a factor F from X and Y, by maps ft and 
if such that the following diagram commutes: 



Theorem 2. Let S C X x Y. If F is a common factor of X and Y, then 
M{S) > H{F). 

More precisely, if Z is a protocol for S, then Z factors onto F. 

As T is a common factor of EQt, this gives again a new proof of the propo¬ 
sition. 

Proof. Let 9 = cjXix = 'ifH-Y '■ S ^ F denote the common map and (Z, Sx, Sy) 
be a protocol for S. 

Recall the following diagram, where we assume wlog that II^ is onto. 



TLxxy 

Hz 

s 

G- 

L -G 

{x,y) 

G-i 

{z,x,y) 


9 can be lifted to a map 9 from L to F hy 9 = 9IIxxy- Now it is easy to see 
that 9 depends only on z. Indeed, suppose that (x,y,z) S L and (x',y',z) G 
L. By definition of a protocol, we also have {x,y',z) G L. Now 9{x,y,z) = 
(j){x) = ifiy), 9{x^y',z) = 4>{x) = f^iy'), and 9{x',y',z) = (j){x^) = from 

which it follows 9{x,y,z) = 9{x',y',z). This means that h = 9IV^^ is actually 
a function, which is obviously shift-invariant, onto, and standard topological 
arguments show it is continuous. Hence we have a factor map from Z to W, 
and H{Z) > H{W). □ 

An additional lower bound can be made easily, using the notion of conditional 
entropy. We adapt slightly the definitions from [3] to make them work better 
in our context. 

Let S' C A X y be a subshift. For x G X, let S“^(a:) be the set of words y 
such that {x,y) G S and Cn{x) be the number of different words of size n that 
can appear in position [0,n — 1] in a word in S~^{x). 
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Then we can define: 


fls(yw = iim8upfilfd£); 

n 

Hs{Y\X) = sup Hs{Y\x) . 

x£X 

Hence Hs(Y\X) measures somehow how many different words y may corre¬ 
spond to a given word x € X. Let us give an example. Let LEQ = {{x, y) G {0,1}^ x {0, l}^|Vz, Xi < yi}. 
Then HLEQiY\x) = log2 if a: = • • • 000 • • •, HLEQiY\x) = 0 if x = • • • 111 • • •, 
and Hleq{Y\x) = log2/2ii'\/i,Xi = i mod 2. It is easy to see that iLL£;Q(y| X) = 
log 2. 

Theorem 3. Let S C X x Y. Then Af{S) > H{Y) - Hs{Y\X). 

For S = EQtj Hs{Y\X) = 0, which gives again a new proof of the proposi¬ 
tion. 

Proof. First let us detail the proof in the finite case. In this case s(F|X) denotes 
the maximum number of different y that can be associated to a given x. 

Let {Z, Sx, Sy) be a protocol. Now we can enumerate Y like this: first 
choose some z arbitrarily. For this z, choose a unique x so that (x,z) G Sx- 
Then enumerate all words y such that {x, y) G S. Now the number of y that 
are enumerated for a given z is less than s(y|X), hence |y| < |Z||s(y|X)| 

Now, for the infinite case. Let {Z, Sx, 5^) be a protocol. By properties of the 
conditional entropy [1], H{Sy) < Hsy{Y\Z) + H{Z). Recall that Hsy{Y\Z) = 
sup-,g 2 Hsy {Y\z) and let z £ Z. There exists x such that (x, z) G Sx- Now if y 
is such that {y,z) G 5”^ then (x, y) G S. This implies that (y|z) < Hs{Y\x)- 
Hence Hsy{Y\z) < Hs{Y\X), and finally HsyIy\Z) < Hs{Y\X). 

We obtain iL(S'v) < Hs{Y\X)+ H{Z), hence H{Z) > H{Sy) - Hs{Y\X) > 

H{Y)-Hs(Y\X). □ 

We now go back again specifically to the equality example. If we look at the 
propositions above, we can conclude that if {Z, Sx, Sy) is a protocol for EQt, 
then there exist Z' C Z and a factor map from Z' to T. 

We look now at T = {3,4}^ U {5,6}^. T is of entropy log 2. However, 
there is no factor from Z = {0,1}^ to T. That is, there are no protocols 
{Z,Sx,Sy) for EQt where Z = {0,1}^. This means that having a protocol 
with H{Z) = log 2 is not the same as having a protocol with Z = {0,1}^. This 
is however an artifact of protocols of exact communication complexity and this 
does not happen otherwise, as illustrated as follows. 

We now introduce a class of shifts, parametrized by P, called the /3-shifts 
[mils], which have the following properties: The /3-shift has entropy log /3, and 
for /3 G N \ {0,1}, the /3-shift coincides with the full shift {0,1.. ./3}^. The 
exact definition is not important, but let just note that the /3-shift corresponds 
somehow to numeration in the (possibly nonintegral) base /3. 

Then we can prove 

Theorem 4. Let S C X xY be a subshift. 

For any /3 > .Af{S), there is a protocol {Z, Sx, Sy) where Z is the ft-shift. 

The preceding discussion shows that the result is not true for ft = J\f{S). 
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Proof. The idea is to use Krieger’s embedding theorem [5], which states that any 
subshift S can be embedded into a subshift T provided that T has bigger entropy, 
has more periodic points, and satisfies a technical condition (be a mixing SFT). 
The set of /3 for which this technical condition is true is dense in [1, +oo[ [TT| . 

Let P > M{S). By definition of M{S), there exists a protocol {Z,Sx,Sy) 
for S where H{Z) < p. 

By changing Z to a, product of Z and a Thue-Morse shift, we may assume 
wlog that Z has no periodic point. 

Now we can find H{Z) < P' < P so that the /3'-shift is a (mixing) SFT. As 
Z has no periodic points, we can use Krieger’s embedding theorem to embed Z 
into the /3'-shift, hence into the /3-shift. 

Now we have obtained a protocol {Z, Sx, Sy) where Z is included in the 
/3-shift. Replacing Sx by Sx ri {A x Z), and similarly for Sy, we may replace 
Z by the whole /3-shift and then obtain the theorem. □ 

4 Amortized Communication Complexity 

In this section, we give a link between infinite communication complexity and 
asymptotic communication complexity. Let R C X x Y he a relation. We 
denote by i?" C A” x Y" the relation (x, y) G i?” V* < n, {xi, yi) G R. 

Definition 4.1 ([5]). The asymptotic (amortized) communication complexity 
of R is 

=lim ~^(^") . 
n n 

With the same notation we denote by R^ C x the relation (x, y) G 
jfz. g {xi,yi) G R. 

Theorem 5. J\f{R^) = 

In other words, the asymptotic complexity is the same as the infinite com¬ 
plexity. 

Before proving the result, we need a related proposition, that states that 
subshifts with simple description admit protocol with simple descriptions: 

Proposition 4.2. Let S C X x Y be a subshift of finite type. If {Z, Sx, Sy) is 
a protocol for S, then for any e, there exists a protocol {Z', S'x, Sy) for S where 
Z', S'x and S’y are also subshifts of finite type. 

Proof. Let {Z, Sx, Sy) be a protocol for S. 

Z, Sx and Sy are defined by families of forbidden patterns, denoted by Z"^, 
Sx and Sy, the subshifts forbidding only the first n patterns. Hence Sx = 
DnSx, and similarly for Z, Sy. 

If we do the same protocol with Sx instead of Sx, we will recognize a 
superset of S that we call S'". Let us prove S = n„S”, one inclusion being 
obvious. If {x,y) G n„S", then there exists Zn G such that {x,z^)€S^ and 
{y,Zn) G Sy. By compactness, the sequence {Zn) admits a limit point z G 
that satisfies {x,z) G Sx and {y,z) G Sy, hence {x,y) G S. 

But S is supposed to be of finite type, so defined by finitely many patterns. 
At some point no all these patterns will be forbidden, that is S = n„<TipS". 


9 



But this means that for all n > no, {Z^, S'^^Sy) is a protocol for S', for which 
all subshifts involved are of finite type. 

Now, as entropy is upper-semicontinuous for shift spaces, H{Z) = lim„ H{Zn), 
hence for n > uq big enough, we will have H{Zn) < H{Z) + e which proves the 
proposition. □ 

Proof of Theoreml^ • First, let us prove J\f{R^) < {R). Let e > 0 

and let {Z,Sx^,Sy^) be a (finite!) protocol for i?” of complexity at most 
nAf‘^^v^P{R) + ne, that is log jZ] < nA/'“*J^™P(i?) + ne. We build a protocol 
for R^ as follows. 

Let Z' be the subshift over the language Z U {_L} defined as follows: a 
word w is in Z' if and only if every factor of w of length n contains exactly 
one letter in Z. In other words, a word in Z' contains a letter in Z, then 
n — 1 symbols _L, then a letter in Z, ad libitum. It is clear that Z' is of 
entropy < M‘^^v^p{R) + e. 

We now define Sx^ as follows: {x,z) G Sx^ if and only ii z € Z' and, if 
we denote hy I = i + nh the positions in z where the letter is not _L, then 
for all j, {xi+jnXi+i+jn ■ ■ ■ Xi+n-i+jn,Zi+jn) G Sx^- We define Syi in the 
same way. 

It is clear from the definition that we obtain this way a protocol for of 
size at most (R) + e, which gives the result. 

• M{R?‘) < Let e > 0 and let {Z, Sx^, Sy^) be a (infinite) 

protocol for R^. As R^ is of finite type (it is defined by forbidden patterns 
of size 1), we may suppose by the previous proposition that the protocol 
is of finite type. 

Let L as above be the set of tuples (z, x, y) such that z G Z, (a;, z) G 5'x*, 
(y, z) G Syz. L is a subshift of finite type (it is the intersection of three 
subshifts of finite type), hence can be defined by a finite set of forbidden 
words, say of size r. 

Let n > r. We now describe a protocol for i?". On input (x, y), we send 
to Alice and Bob a word z of size n that is valid for Z, and Alice (resp. 
Bob) sends to Bob (resp. Alice) its first and last r letters. Now Alice 
looks whether the pattern (x, z) appears in some valid word of 5'x*, and 
whether the two patterns of length r that Bob sent to her appears in some 
valid word of L. In this case, Alice accepts. Bob does the same. 

Now, if (x, y) G i?", it is clear that the above protocol works: just complete 
{x,y) to obtain an infinite word {x°°,y°°) with {x,y) in its center in R^, 
take z°° to be the word that proves that {x°°,y°°) is indeed valid, and 
take for z the central n letters of z°°. 

Conversely, suppose that {x, y) is accepted by the protocol. We now look 
at the word (x, y, z) of size n we obtain. By construction, this word does 
not contain any forbidden word of L. Furthermore, its first and last r 
letters appear in some (possibly different) infinite words of L\ this means 
that we can complete it into an infinite word in L. This proves that there 
exists an infinite word (a:°°, i/°°, z°“) in L with (x, y, z) at its center, hence 
that (x°°,y°°) G which implies that (x,y) G i?". 
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If we denote by c„ the number of words of Z of size n, then this protocol 
uses logc„ + 41ogr bits, that is < logc„ + 41ogr, therefore 

^ = H{Z) < + 6 . 

n n n n 

□ 


5 Application to 2D languages 

As hinted above, the main motivation comes from the theory of 2D languages 
and in particular the definition of regular languages of infinite pictures, which 
are called sofic in this context. 

The best way to define sofic (2D-)shifts uses the well-known concept of Wang 
tiles from tiling theory, as introduced by Hao Wang [14]. In our context, a Wang 
tile is a unit square with colored edges and some symbol a; S E at its center. 
A tiling by a finite set r of Wang tiles associates to each point of the discrete 
plane 1? a Wang tile so that contiguous edges have the same color. By looking 
at the symbol at the center of each tile, a tiling by r gives rises to an infinite 
picture w G E^ . The sofic shift defined by r is then the set of infinite pictures 
we obtain this way. 

A first example is presented in Figured) This set of Wang tiles produces a 
lot of different tilings but only two different pictures (up to translation): one 
with the symbol 0 everywhere, and the other one with only one occurrence of 
the symbol 1. Hence the set of pictures over the alphabet {0,1} containing at 
most one occurrence of the symbol 1 is a sohc shift. 

The main question we want to tackle is a way to decide whether a given 
set of infinite pictures is indeed a sofic (2D-)shift. For one-dimensional shifts, 
where sofic shifts can be defined in a similar way, a shift S is sofic if and only if 
the set of finite words it contains is regular. 

Proposition 5.1. Let S be a two-dimensional shift over an alphabet A. 

Forn an integer, denote by Ln{S) the set of infinite words over that may 
appear as n consecutives rows in some element of S. Let 

Rn,m{S) = {(x, y) e Ln{S) X L^{S)\xy e 

If S is sofic, then N{Rn,m[S) = 0(1) (independently of n and m). 

Proof. Let r be the set of Wang tiles that defines S. The protocol is obvious. 
Alice, on input x, chooses a way to tile its part of the space (that may be 
extended into a tiling of an entire half-plane), and sends to Bob what are the 
colors in their common border. Then Bob accepts iff he can tile its part of the 
space (and the rest of the plane) respecting this border condition. □ 

The proposition is not a characterisation. It turns out however that it be¬ 
comes a characterisation for one-dimensional languages. 

Theorem 6. Let S be a one-dimensional shift. Let Ln{S) be the set of words 
of size n of S and 

Rn,m{S) = {(x, y) e L„{S) X L^{S)\xy e An-t-m(*5')} . 

Then S is sofic iff M {Rn,m{S)) = 0(1). 
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Figure 1: A set of Wang tiles and the tilings we obtain. There are countably 
many tilings but only 5 of them up to translation. All tilings except Cij cor¬ 
respond to an inhnite picture with the symbol 0 everywhere, while the tiling 
Cij corresponds to an inhnite picture with the symbol 0 everywhere except on 
position {i,j), which contains the symbol 1. 
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Proof. One direction is clear. Suppose that S is not sofic. To simplify the 
exposition, we assume that S is over the alphabet {0,1}. Let L{S) be the set of 
finite words that might appear in S. If S is not sofic, then L{S) is not regular; 
This implies that there exists a sequence of words (iti)igN in the language of S 
such that all residual sets 

u~^S = {x G {0, \Y\uiX G L{S)} 

are distinct. Wlog we may suppose that words in the sequence (ui) are of 
increasing size, and that Ui is of size at least i. 

Let n be an integer, and denote by k the size of We will prove that there 

exists m so that M{Rk,m) > log log log n, hence ■) is not bounded. 

By definition, for any word u, u~^S = (0it)“^S'U(lM)“^S'. Hence all residual 
sets for words of size less than k can be expressed in terms of residual sets for 
words of size k > n. As ui ... Un (of size less than or equal to k) give n 
different residual sets, this implies in particular that there should be at least 
logn different residual sets for words of size k. 

As there are finitely many residual sets for words of size k, there exists a 
constant m such that if u~^S ^ v~^S, for of size fc, then there exists w of 
size at most m so that uw G S vw ^ S. As any word in S of size less 

than m can be prolonged into a word of S of size to, we may suppose that w is 
of size exactly to. 

We therefore have obtained the following: We have a family vi ... v\ogn in 
Lk{S) for which Hi ^ j, there exists w G Lm{S) so that ViW G S vjw ^ S. 

This implies in particular that in a protocol {Z, Sa, Sb) for Rk,m, each Vi 
must issue a different set of responses z € Z, which implies that > logn. 
This implies that log|Z| > log log logn. This is true for any protocol, hence 
M{Rk,m) > log log logn. □ 

Proposition 15.11 gives some insight into the extension conjecture. 

Definition 5.2. If S is a set of infinite words, let S’^ denote the set of infinite 
pictures, where each row is an element of S, different rows possibly corresponding 
to different elements of S. 

Conjecture 1 (Extension conjecture). S'^ is sofic only if S is sofic. 

This conjecture was proven in some particular cases [niiT] that may be seen 
as instances of a general communication complexity argument that we formulate 
now. 

Indeed, suppose that is sofic. Then by the previous proposition, M{Rn,n{S'^)) 
0(1). But Rn,n{S'^) = Rn,n{SY which means that results from the previous 
section can be applied: A/"(i?„,„(5'Z)) = A/’“®^"‘^(i?„,n(*S')). 

Now, well-known results from Communication Complexity about direct sums 
permit to estimate A/’“®^"*^(i?) from J\f{R): 

Theorem 7 ([B],[H1 Corollary 4.9]). For any relation R C {0,1}™ x {0,1}™, 
we have [R) > Af{R) — log to -I- 0(1). 

Corollary 5.3. If is sofic, then M{Rn,n{S)) < log |E| logn-I-0(1). 

We may replace in the theorem the right-hand term by log |L„(S')| -I- 0(1). 
(Alice and Bob may reject inputs (x,y) that are not in Ln{S) x Ln(S) as they 
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cannot correspond to a valid word xy € L 2 n{S)). This corollary may be seen as 
a reformulation of Proposition 4.3] in a different vocabulary which makes 
the theorem more natural. 

Note also that if S is sofic then Af{Rn,n) = 0(1), which means that possible 
counterexamples to the conjecture entail sets S for which the communication 
complexity is low but nonconstant. 

A specific potential counterexample is mentioned in m- Let S be the sub¬ 
shift whose set of forbidden patterns is .A = {ca'^db^c}. That is, every time the 
pattern caJ^b^c appears in some word of 5", we must have n ^ m. 

For this particular example, we have N{Rn,n{S)) = log |L„(5')| -b 0(1). To 
simplify the exposition, suppose that Alice’s word (of size n) ends with wca^d, 

Bob’s word begins with 6"*c and they want to know whether k ^ m. We will give 
a (well-known) protocol of complexity 1 -bloglogn ~ log |L„(S')|. The following 
nondeterministic protocol can be used: Alice chooses nondeterministically an 
integer i between 1 and logn and sends i and the f-th bit of k to Bob. Then 
Bob tests whether the Ath bit of m is different from what Alice sent. 

Theorem[7]gives only a lower bound and does not preclude that A/’“®^'"P(i?„^„(S')) ^ 0(1). 
However it is well-known in this case FracCov that we have A/’“®^'"P(i?„^„(S')) = 0(1), 
so that Proposition 15. II is not sufficient to treat this particular counterexample. 


6 Open Problems 

The main open question is related to the definition of Communication Com¬ 
plexity as an infimum. We do not know whether a protocol of optimal commu¬ 
nication complexity {J\f{S)) is always possible. 

Section |4] suggests a link between the infinite communication complexity of a 
subshift S and the asymptotic limit of communication complexity of the analog 
problem for finite words. We have specific counterexamples showing the two 
quantities are not always equal, but we conjecture that the finite version is an 
upper bound for the infinite version. 

In terms of tilings, proving the extension conjecture, in particular using 
infinite communication complexity, remains open. 
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